A Unifying Framework for Learning Bag Labels from Generalized Multiple-Instance Data

نویسندگان

  • Gary Doran
  • Andrew Latham
  • Soumya Ray
چکیده

We study the problem of bag-level classification from generalized multiple-instance (GMI) data. GMI learning is an extension of the popular multiple-instance setting. In GMI data, bags are labeled positive if they contain instances of certain types, and avoid instances of other types. For example, an image of a “sunny beach” should contain sand and sea, but not clouds. We formulate a novel generative process for the GMI setting in which bags are distributions over instances. In this model, we show that a broad class of distribution-distance kernels is sufficient to represent arbitrary GMI concepts. Further, we show that a variety of previously proposed kernel approaches to the standard MI and GMI settings can be unified under the distribution kernel framework. We perform an extensive empirical study which indicates that the family of distribution distance kernels is accurate for a wide variety of real-world MI and GMI tasks as well as efficient when compared to a large set of baselines. Our theoretical and empirical results indicate that distribution-distance kernels can serve as a unifying framework for learning bag labels from GMI (and therefore MI) problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Re-Ranking of Images from the Web using Bag based Method

An image retrieval system is a computer system for browsing, searching and retrieving images from a large database of digital images. Given a textual query in traditional text based image retrieval (TBIR),relevant images are to be re ranked using visual features after the initial text based image search. In this paper, we propose a new bag based re ranking framework for large scale TBIR. We com...

متن کامل

A Unifying Framework for Learning Bag Labels from Generalized Multiple-Instance Data: Supplementary Materials

wi = (∑ xj∈B 1 [(xi, xj) ∈ G(B)] )−1 . That is, wi is the reciprocal of the number of instances adjacent to xi in the graph G(B) of bag B. G(B) contains an edge for every pair of instances whose distance is less than some threshold τ and includes self-edges. Under the view of bags as distributions, mi-Graph can be viewed as performing the mean embedding on a weighted sample, or a sample drawn f...

متن کامل

Deep Multiple Instance Learning for Zero-shot Image Tagging

In-line with the success of deep learning on traditional recognition problem, several end-to-end deep models for zero-shot recognition have been proposed in the literature. These models are successful to predict a single unseen label given an input image, but does not scale to cases where multiple unseen objects are present. In this paper, we model this problem within the framework of Multiple ...

متن کامل

Review of Multi-Instance Learning and Its applications

Multiple Instance Learning (MIL) is proposed as a variation of supervised learning for problems with incomplete knowledge about labels of training examples. In supervised learning, every training instance is assigned with a discrete or real-valued label. In comparison, in MIL the labels are only assigned to bags of instances. In the binary case, a bag is labeled positive if at least one instanc...

متن کامل

Multiple-Instance Active Learning

We present a framework for active learning in the multiple-instance (MI) setting. In an MI learning problem, instances are naturally organized into bags and it is the bags, instead of individual instances, that are labeled for training. MI learners assume that every instance in a bag labeled negative is actually negative, whereas at least one instance in a bag labeled positive is actually posit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016